Speedata: multilingual spoken data entry
نویسندگان
چکیده
1 In this paper we present a multilingual application for speech technology. The SpeeData project aims at building a demonstrator that provides a user-friendly interface for spoken data-entry in two languages: Italian and German. The application domain is the land register of an Italian region in which both languages are o cially spoken. The considered data-entry task is particularly challenging as it considers many di erent types of data e.g. long texts, numbers, proper names, tables, etc.and a variety of of pronunciations, since dialects are present and users will not always speak in their native language.
منابع مشابه
Speedata: a prototype for multilingual spoken data-entry
In this work we describe the development and evaluation of SpeeData, a prototype for multilingual spoken data-entry. The SpeeData project aims at developing a demonstrator that provides a user-friendly interface for spoken data-entry in two languages: Italian and German. A real world application domain is considered, which is the Land Register of an Italian region in which both languages are oo...
متن کاملApplication of Speech Technologyin
In this paper we present a new application for speech technology. In the SpeeData project a user of the system will speak in any of the provided languages, which presently are the two languages German and Italian. The system will analyse the utterance and generate a data base entry. At the same time, relevant information will be translated into the other language. This paper presents an overvie...
متن کاملCorpus Based Analysis for Multilingual Terminology Entry Compounding
This paper proposes statistical analysis methods for improvement of terminology entry compounding. Terminology entry compounding is a mechanism that identifies matching entries across multiple multilingual terminology collections. Bilingual or trilingual term entries are unified in compounded multilingual entry. We suggest that corpus analysis can improve entry compounding results by analysing ...
متن کاملAn Empirical Study of Multilingual Spoken Term Detection
This paper introduces the design of multilingual spoken term detection (STD) system using CALLHOME and CALLFRIEND multilingual databases published by Linguistic Data Consortium. For our experiments seven languages namely Arabic, English, German, Japanese, Korean, Chinese Mandarin and Spanish, are used to train and evaluate the STD system. As the core module of our language general STD system, t...
متن کاملA Multilingual Spoken Dialog System
This paper will briefly introduce MSDSKIT-1 (Multilingual Spoken Dialogue System Version 1.0 developed by Kyoto Institute of Technology) which integrates Japanese and Chinese now. It is a promotion vision of the SDSKIT-3 (Spoken Dialogue System in Japanese). This system can provide services such as sight-seeing introduction, traffic guidance, hotel reservation. A user can also plan his itinerar...
متن کامل